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(57) Abstract: The present invenlion provides a novel cellulase nucleic acid sequence, designated 029ceU and the corresponding 
029cel amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence 
encoding 029cel, recombinant 029cel proteins and methods for producing the same. 
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NOVEL BACILLUS 029cel CELLULASE 

CROSS-REFERENCE TO RELATED APPLICATIONS 
[01] This application claims priority to U.S. Provisional Application No. 60/466,831 filed 
April 29, 2003 (Attorney Docket No. GC796P), which is herein incorporated in its entirety by 
5 reference. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER FEDERALLY 
SPONSORED RESEARCH AND DEVELOPMENT 

[02] Not applicable. 

10 FIELD OF THE INVENTION 

[03] This invention relates to a novel cellulase referred to herein as 029cel. Also 
described are nucleic acids encoding the cellulase, compositions comprising said cellulase, 
methods of identifying novel cellulases and methods of using said compositions. Preferably 
the cellulase(s) are isolated from Bacillus species, preferably S, agaradhaerens. The 
15 present invention further relates to the use of the novel cellulase in compositions recognized 
in the art as advantageously having cellulase added thereto, including, as an additive in a 
detergent composition, in the treatment of cellulose containing fabrics, in the treatment of 
pulp and paper and in the treatment of starch for the production of high fructose corn-syrup 
or ethanol. 

20 BACKGROUND OF THE INVENTION 

[04] Cellulose and hemicellulose are the most abundant plant materials produced by 
photosynthesis. They can be degraded and used as an energy source by numerous 
microorganisms, including bacteria, yeast and fungi, that produce extracellular enzymes 
capable of hydrolysis of the polymeric substrates to monomeric sugars (Aro ef a/., 2001). As 
25 the limits of non-renewable resources approach, the potential of cellulose to become a major 
renewable energy resource is enormous (Krishna et aL, 2001). The effective utilization of 
cellulose through biological processes is one approach to overcoming the shortage of foods, 
feeds, and fuels (Ohmiya etaL, 1997). 

[05] Cellulases are enzymes that hydrolyze cellulose (beta-1,4-glucan or beta D- 
30 glucosidic linkages) resulting in the formation of glucose, cellobiose, cellooligosaccharides, 
and the like. Cellulases have been traditionally divided into three major classes: 
endoglucangses (EC 3.2. 1 .4) ("EG"), exoglucanases or cellobiohydrolases (EC 3.2. 1.91) 
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("CBH") and beta-glucosidases ([beta] -D-glucoside glucohydrolase; EC 3.2.1.21) ("BG"). 
(Knowles et ai, 1987; Shulein, 1988). Endoglucanases act mainly on the amorphous parts 
of the cellulose fibre, whereas cellobiohydrolases are also able to degrade crystalline 
cellulose (Nevalainen and Penttila, 1995). Thus, the presence of a cellobiohydrolase in a 
5 cellulase system is required for efficient solubilization of crystalline cellulose (Suurnakki, ef 
ai 2000). Beta-glucosidase acts to liberate D-glucose units from celloblose, cello- 
oligosaccharides, and other glucosides (Freer, 1993). 

[06] In order to efficiently convert crystalline cellulose to glucose the complete cellulase 
system comprising components from each of the CBH, EG and BG classifications is 

10 required, with isolated components less effective in hydrolyzing crystalline cellulose (Filho et 
ai, 1996). A synergistic relationship has been observed between cellulase components 
from different classifications, ^ In particular, the EG-type cellulases and CBH- type cellulases 
synergistically interact to more efficiently degrade cellulose. See, e.g.. Wood, 1985. 
[071 Although cellulase compositions have been previously described, there remains a 

15 need for new and improved cellulase compositions for use in household detergents, 

stonewashing compositions or laundry detergents, etc. Cellulases that exhibit improved 
performance are of particular interest. 

BRIEF SUMMARY OF THE INVENTION 

[08] It is an object of the present invention to provide a novel cellulase having beneficial 
20 properties for use in detergents, treating textiles, biomass conversion and pulp and paper 
manufacturing. 

[09] It is an object of the present invention to provide polypeptides having cellulolytic 
activity and polynucleotides encoding the polypeptides. The polypeptides may improve the 
degradation of cell wall material, e.g., cellulose and/or hemicellulose. The polypeptides may 
25 also improve the stability or activity of other enzymes involved in the degradation of plant cell 
wall material, e.g., biomass. 

[101 An object of the present invention is to provide a novel cellulase and derivatives 
thereof, methods of producing such cellulases, and compositions comprising such novel 
cellulases. The present invention further relates to the use of the novel cellulase and 

30 derivatives thereof in compositions recognized in the art as advantageously having cellulase 
added thereto, including, as an additive in a detergent composition, in the treatment of 
textiles such as cellulose-containing fabrics and fibers useful therefor, as an animal feed 
additive, in biomass conversion, in the treatment of pulp and paper and in the treatment of 
starch for the production of high fructose corn-syrup or ethanol, 

35 [11] It is a further object of the present invention to provide for a method of producing a 
novel cellulase via heterologous expression from recombinant host cells. 
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[12] It is yet a further object of the present invention to provide a nucleic acid sequence 
encoding the inventive cellulase. In one aspect, the nucleic acid and amino acid sequence 
facilitate commercial production of the novel cellulase and cellulase compositions of the 
invention. 

5 [13] It is still a further object of the present invention to provide a novel cellulase ha\^ng 
excellent properties for use in detergents, treating textiles, as a feed supplement and in pulp 
and paper manufacturing. In a further aspect, the cellulase finds use in biomass conversion. 
[14] In a first aspect, the invention includes an isolated polynucleotide having a sequence 
which encodes 029cel, a sequence complementary to the 029celgene coding sequence, 
10 and a composition comprising the polynucleotide. The polynucleotide may be mRNA, DNA, 
cDNA, genomic DNA, or an antisense analog thereof. 

[15] In one embodiment, a 029cel polynucleotide may comprise an isolated nucleic acid 
molecule which hybridizes to the complement of the nucleic acid presented as SEQ ID NO:2 
under moderate to high stringency conditions, where the nucleic acid molecule encodes a 

15 029cel polypeptide that exhibits cellulose binding activity. 

[16] The polynucleotide having at least 80%, 85%. 90%, 95%, 98% or more sequence 
identity to the sequence presented as SEQ ID NO:2 may encode a 029cel protein. In a 
specific embodiment, the polynucleotide comprises a sequence substantially identical to 
SEQ ID NO:2. The invention also contemplates fragments of the polynucleotide, preferably 

20 at least about 15-30 nucleotides in length. 

[171 In a second aspect, a novel cellulase or a derivative is provided which is obtainable 
from a Bacillus, Preferably, the cellulase of the invention comprises an amino acid 
sequence according to Figure 3 (SEQ ID NO:3), a fragment, or a derivative thereof, having 
greater than 90% sequence identity, preferably greater than 95% sequence identity and 

25 more preferably greater than 97% sequence identity to an active portion thereto. 

[18] In a third aspect the present invention relates to a nucleic acid construct comprising 
the nucleotide sequence, which encodes for the polypeptide of the invention, operably linked 
to one or more control sequences that direct the production of the polypeptide in a suitable 
host. 

30 [19] The invention further provides recombinant expression vectors containing a nucleic . 
acid sequence encoding 029cel or a fragment or splice variant thereof, operably linked to 
regulatory elements effective for expression of the protein in a selected host. In a related 
aspect, the invention includes a host cell containing the vector. 

[20] In a fourth aspect the present invention relates to a recombinant expression vector 
35 comprising the nucleic acid construct of the invention. 

[21] In a fifth aspect the present invention relates to a recombinant host cell comprising 
the nucleic acid construct of the invention. 
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[22] The invention further includes a method for producing 029cel by recombinant 
techniques, by culturing recombinant prokaryotic or eukaryotic host cells comprising nucleic 
acid sequence encoding 029cel under conditions effective to promote expression of the 
protein, and subsequent recovery of the protein from the host cell or the cell culture medium. 
5 [23] In a sixth aspect the present invention relates to a method for producing a 

polypeptide of the invention, the method comprising: (a) cultivating a microorganism, which 
in its wild-typs form is capable of producing the polypeptide, to produce the polypeptide; and 
(b) recovering the polypeptide. ^ 

[24] In a seventh aspect the invention provides for an enzymatic composition useful in the 
10 conversion of cellulose to ethanol. In a preferred embodiment the enzymatic composition 

comprises 029ceL The composition may further comprise additional cellulase enzymes such 
as endoglucanases and/or cellbiohydrolases. The composition may be enriched in 029ceL 
[25] In one embodiment the invention provides a method of identifying novel enzymes by 
isolating total microbial community DNA from an environment, constructing a genomic DNA 
15 library in E.co//, screening the library for expression of cellulase activity, identifying the 

cellulase gene in the cellulase-positlve clone and characterising the novel cellulase enzyme. 
[26] Further provided herein are analytical methods for detecting 029cel nucleic acids and 
029cel proteins also form part of the invention. 

[27] According to yet another embodiment of the invention, a method of transforming a 
20 suitable microorganism with nucleic acid sequence encoding a cellulase according to the 
invention is provided. A method of producing the cellulase according to the invention from 
said transformed microorganism is provided. 

[28] A further object of the invention is to provide an expression vector particularly 
effective in Streptomyces, Streptomyces serve as alternate host cells for the production of 

25 various proteins and with respect to the expression and production of cellulases may offer a 
number of advantages over Bacillus host cells particularly when cells are grown at a high cell 
density. A preferred expression vector comprises a regulatory polynucleotide sequence 
including a promoter sequence derived from a glucose isomerase gene of Actinoplanes, a 
signal sequence derived from a Streptomyces cellulase gene, and a DNA sequence 

30 encoding a cellulase, particularly a cellulase according to the invention. 

[29] In a preferred embodiment of the present invention, a full-length cellulase is 
obtainable from Bacillus. 

[30] Other objects, features and advantages of the present invention will become 
apparent from the following detailed description. It should be understood, however, that the 
35 detailed description and specific examples, while indicating preferred embodiments of the 
invention, are given by way of illustration only, since various changes and modifications 
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within the scope and spirit of the invention will become apparent to one skilled in the art from 
this detailed description. 



5 131] Figure 1 illustrates the environmental nucleotide sequence (SEQ ID NO:1). 

[32] Figure 2 illustrates a nucleic acid sequence encoding the novel cellulase (SEQ ID 
NO:2). 

[33] Figure 3 illustrates the deduced amino acid sequence of the inventive cellulase (SEQ 
ID NO:3). 



[34] The invention will now be described in detail by way of reference only using the 
following definitions and examples. All patents and publications, including all sequences 
disclosed within such patents and publications, refenred to herein are expressly incorporated 
by reference. 

15 [35] Unless defined otherwise herein, all technical and scientific terms used herein have 
the same meaning as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Singleton, et al., Dictionary OF Microbiology and Molecular 
BIOLOGY, 2d Ed., John Wiley and Sons, New York (1994), and Hale & Marham, The Harper 
Collins Dictionary of Biology, Harper Perennial, NY (1991) provide one of skill with a 

20 general dictionary of many of the terms used in this invention. Although any methods and 
materials similar or equivalent to those described herein can be used in the practice or 
testing of the present invention, the preferred methods and materials are described. 
Numeric ranges are inclusive of the numbers defining the range. Unless othenwise 
indicated, nucleic acids are written left to right in 5' to 3' orientation; amino acid sequences 

25 are written left to right in amino to carboxy orientation, respectively. Practitioners are 

particularly directed to Sambrook et aL, 1 989, and Ausubel FM et aL, 1 993, for definitions 
and terms of the art. It is to be understood that this invention is not limited to the particular 
methodology, protocols, and reagents described, as these may vary. 
[36] The headings provided herein are not limitations of the various aspects or 

30 embodiments of the invention which can be had by reference to the specification as a whole. 
Accordingly, the terms defined immediately below are more fully defined by reference to the 
specification as a whole. 

[37] All publications cited herein are expressly incorporated herein by reference for the 
purpose of describing and disclosing compositions and methodologies which might be used 
35 in connection with the invention. 
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I. DEFINITIONS 

[381 "Cellulase," "cellulolytic enzymes" or "cellulase enzymes" means the inventive 
bacterial endoglucanase described herein. Three different types of cellulase enzymes act 
synergistically to convert cellulose and its derivatives to glucose. 

•5 [39] The term "cellulase" refers to a category of enzymes capable of hydrolyzing cellulose 
polymers to shorter cello-oligosaccharide oligomers, cellobiose and/or glucose. Numerous 
examples of cellulases, such as exoglucanases, exocellobiohydrolases. endoglucanases. 
and glucosidases have been obtained from cellulolytic organisms, particularly including 
fungi, and bacteria. The enzymes made by these microbes are mixtures of proteins with 

10 three types of actions useful in the conversion of cellulose to glucose: endoglucanases (EG), 
cellobiohydrolases (CBH), and beta-glucosidase. These three different types of cellulase 
enzymes act synergistically to convert cellulose and its derivatives to glucose. 
[401 Many microbes make enzymes that hydrolyze cellulose, including the wood rotting 
fungus Trichoderma, the compost bacteria Thermomonospora, Bacillus, and Cellulomonas; 

15 Streptomyces; and the fungi Humlcola, Aspergillus and Fusarium. 

[411 By the term "host cell" is meant a cell that contains a vector and supports the 
replication, and/or transcription or transcription and translation (expression) of the 
expression construct. Host cells for use in the present invention can be prokaryotic cells, 
such as E. CO//, or eukaryotic cells such as yeast, plant, insect, amphibian, or mammalian 

20 cells. In a one embodiment according to the present invention, "host cell" means the cells of 
the genus Bacillus. In another preferred embodiment according to the invention, "host cell" 
means the cells of Streptomyces. A Streptomyces means any bacterial strain that is a 
member of the genus Streptomyces as classified in Buchanan et al., The Shorter Bergey's 
Manual For Determinative Bacteriology (Williams & Wilkens 1 982). Particularly prefenred 

25 strains of Streptomyces include S. lividens, S. rubiginosus, and S. coelicolon S. lividens is 
described in Lomovskaya et al., J. Virology 9:258 (1972). However, one of skill will realize 
that any appropriate host cell, e.g., bacterial, fungal, eukaryotic and plant cell may be used. 
[42] The term "recombinant" when used with reference, e.g., to a cell, or nucleic acid, 
protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been modified by 

30 the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic 
acid or protein, or that the cell is derived from a cell so modified. Thus, for example, 
recombinant cells express genes that are not found within the native (non-recombinant) form 
of the cell or express native genes that are othenwise abnormally expressed, under 
expressed or not expressed at all. 

35 [431 The term "secretory signal sequence" denotes a DNA sequence that encodes a 
polypeptide (a "secretory peptide" or "secretory signal peptide") that, as a component of a 
larger polypeptide, directs the larger polypeptide through a secretory pathway of a cell in 
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which it Is synthesized. The larger peptide is commonly cleaved to remove the secretory 
peptide during transit through the secretory pathway to yield the secretory signal peptide and 
a smaller peptide commonly referred to as the mature polypeptide. 
[44] As used herein, the phrases '^whole cellulase preparation" and "whole cellulase 

5 composition" are used Interchangeably and refer to both naturally occurring and non- 

naturally occurring compositions. A "naturally occumng" composition is one produced by a 
naturally occumng source and which comprises, for example, one or more 
cellobiohydrolase-type. one or more endoglucanase-type, and one or more p-glucosidase 
components wherein each of these components Is found at the ratio produced by the source. 

10 Certain fungi produce complete cellulase systems which include exo-cellobiohydrolases or 
CBH-type cellulases, endoglucanases or EG-type cellulases and beta-glucosidases or BG- 
type cellulases (Schulein, 1988). However, sometimes these systems lack CBH-type 
cellulases and bacterial cellulases also typically include little or no CBH-type cellulases. A 
naturally occurring composition Is one that is produced by an organism unmodified with 

1 5 respect to the cellulolyllc enzymes such that the ratio of the component enzymes Is 
unattered firom that produced by the native organism. 

[45] A "non-naturally occun^ing" composition encompasses those oDmpositions produced 
by: (1) combining component cellulolyUc enzymes either In a naturally occunring ratio or non- 
naturally occurring. I.e.. altered, ratio; or (2) modifying an organism to overexpress or 
20 underexpress one or more cellulolytic enzyme; or (3) modifying an organism such that at 
least one cellulolytic enzyme is deleted or (4) modifying an organism to express a 
heterologous component cellulolytic enzyme. 

[461 As used herein, the term "promoter" refers to a nucleic acid sequence that functions 
to direct transcription of a downstream gene. The promoter will generally be appropriate to 

25 the host cell In which the target gene Is being expressed. The promoter together with other 
transcriptional and translational regulatory nucleic acid sequences (also termed "control 
sequences") are necessary to express a given gene. In general, the transcriptional and 
translational regulatory sequences Include, but are not limited to, promoter sequences, 
ribosomal binding sites, transcriptional start and stop sequences, translational start and stop 

30 sequences, and enhancer or activator sequences. The promoter may be the promoter 

nomially associated with the downstream gene or it may be heterologous, i.e.. fi-om another 
gene or another microorganism as long as It function to direct the gene. A prefen-ed 
promoter when the transformation host cell Is Badllus is the aprE promoter. In one aspect 
ttie promoter is an inducible promoter. In one aspect, when the host cell is a filamentous 

35 fungus, the promoter Is the T. reesei cbh1 promoter which is deposited in GenBank under 
Accession Number D86235. In another aspect the promoter Is a cbh II or xylanase promoter 
from 7. reese'i. 
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[47] A nucleic acid is "operably linked" when it is placed into a functional relationship with 
another nucleic acid sequence. For example, DNA encoding a secretory leader, i.e., a 
signal peptide, is operably linked to DNA for a polypeptide if it is expressed as a preprotein 
that participates in the secretion of the polypeptide; a promoter or enhancer is operably 

5 linked to a coding sequence if it affects the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to facilitate 
translation. Generally, "operably linked" means that the DNA sequences being linked are 
contiguous, and, in the case of a secretory leader, contiguous and in reading phase. 
However, enhancers do not have to be contiguous. Linking is accomplished by ligation at 

10 convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide adaptors 
or linkers are used in accordance with conventional practice. 

[48] "DNA construct" or "DNA vector*' means a nucleotide sequence which comprises one 
or more DNA fragments encoding the novel cellulase. Included in "DNA vectors" are 
"expression vectors." Typical expression vectors contain regulatory sequences such as, 

15 transcription and translation terminators, transcription and translation initiation sequences, 
signal sequences, and promoters useful for regulation of the expression of the particular 
nucleic acid. The tenm "promoter" is used in its ordinary sense to refer to a polynucleotide 
sequence involved in the control of the initiation of transcription of a polynucleotide 
sequence encoding a protein. A "signal sequence" refers to a signal peptide or a portion of a 

20 protein that is capable of directing the transport of a desired protein in bioactive forni from a 
host. The mature form of an extracellular protein lacks the signal sequence which is cleaved 
off during the secretion process. While not meant to limit the invention, the number of amino 
acid residues in a signal peptide may be between about 5 and about 100 amino acid 
residues. Signal sequence may be modified to provide for cloning sites that allow for the 

25 ligation of DNA or insertion of DNA encoding a cellulase. The vectors optionally comprise 
generic expression cassettes containing at least one independent terminator sequence, 
sequences permitting replication of the cassette in prokaryotes, eukaryotes, or both, (e.g., 
shuttle vectors) and selection markers for both prokaryotic and eukaryotic systems. Vectors 
are suitable for replication and integration in prokaryotes, eukaryotes, or both. See, Giliman 

30 and Smith, Gene 8:81-97 (1979); Roberts et aL, Nature 328:731-734 (1987); Berger and 
Kimmel, Guide To Molecui^r Cloning Techniques, Methods In Enzymology, vol 152, 
Academic Press. Inc., San Diego, CA ("Berger"); Scheider, B.. etal., Protein Expr. Purif 
6435:10 (1995); Sambrook ef a/. MOLECULAR Cloning - A Laboratory Manual (2nd ed.) 
VOL. 1-3, Cold Springs Harbor Publishing (1989) ("Sambrook"); and Current Protocols in 

35 Molecular Biology. Ausubel et a/.(eds.). Current Protocols, a joint venture between 
Greene Publishing Associates, Inc. and John Wiley & Sons. Inc., (1997 Supplement) 
("Ausubel"). Cloning vectors useful in Streptomyces are known and reference is made to 
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U.S. Patent Nos. 4.338.397; 4,411.994; 4.513,085; 4,513.086; 4,745,056; 5.514.590; and 
5,622.866 and WO88/07079. 

[491 As used herein, the term "gene" means the segment of DNA involved in producing a 
polypeptide chain, that may or may not include regions preceding and following the coding 
5 region, e.g. 5' untranslated (5' UTR) or "leader" sequences and 3" UTR or "trailer" 

sequences, as well as Intervening sequences (introns) between individual coding segments 
(exons). 

[50] The term "heterologous" when used witti reference to portions of a nucleic acid 
indicates that the nucleic acid comprises two or more subsequences that are not nonnally 

10 found In the same relationship to each other in nature. For instance, the nucleic acid is 

typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein will often refer to two or 
more subsequences that are not found In the same relationship to each other in nature (e.g.. 

15 a fusion protein). 

[511 The term "% homology" is used interchangeably herein with the term "% identity" 
herein and refere to the level of nucleic acid or amino acid sequence identity between the 
nucleic acid sequence that encodes 029cel or the 029cel amino acid sequence, when 
aligned using a sequence alignment program. 

20 [521 For example, as used herein, 80% homology means the same thing as 80% sequence 
identity determined by a defined algorithm, and accordingly a homologue of a given sequence 
has greater than 80% sequence Identity over a length of the given sequence. Exemplary 
levels of sequence identity include, but are not limited to, 80, 85. 90, 95, 98% or more 
sequence identity to a given sequence, e.g., the coding sequence for 029cel. as described 

25 herein. 

[531 Exemplary computer programs which can be used to determine identity between two 
sequences include, but are not limited to, the suite of BLAST programs, e.g.. BLASTN, 
BLASTX. and TBLASTX. BLASTP and TBLASTN, publicly available on the Internet at 
www.ncbi.nlm.nlh.gov/BLAST. See also, Altschul, etal.. 1990 and Altschul, etal., 1997. 

30 [541 Sequence searches are typically carried out using the BLASTN program when 

evaluating a given nucleic acid sequence relative to nucleic acid sequences in the GenBank 
DNA Sequences and other public databases. The BLASTX program is prefen-ed for 
searching nucleic acid sequences that have been translated in all reading frames against 
amino acid sequences In the GenBank Protein Sequences and other public databases. Both 

35 BLASTN and BLASTX are run using default parameters of an open gap penalty of 1 1 .0, and 
an extended gap penalty of 1.0. and utilize the BLOSUM-62 matrix. (See. e.g.. Altschul. ef 
a/.. 1997.) 

9 
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[551 A preferred alignment of selected sequences In order to determine "% identity" 
between two or more sequences, is perfonned using for example, the CLUSTAL-W program 
in MacVector version 6.5, operated with default parameters, including an open gap penalty 
of 10.0. an extended gap penalty of 0.1, and a BLOSUM 30 similarity matrix. 

5 [561 The terms "isolated" or "purified" as used herein refer to a nucleic acid or amino acid 
that is removed from at least one component with which it is naturally associated. 
[571 In the present context, the term "substantially pure polypeptide" means a polypeptide 
preparation which contains at the most 10% by weight of other polypeptide material with 
which it is natively associated (lower percentages of other polypeptide material are 

10 preferred, e.g. at the most 8% by weight, at the most 6% by weight, at the most 5% by 

weight, at the most 4% at the most 3% by weight, at the most 2% by weight, at the most 1% 
by weight, and at the most 1/2% by weight). Thus, it is prefenred that the substantially pure 
polypeptide is at least 92% pure. i.e. that the polypeptide constitutes at least 92% by weight 
of the total polypeptide material present in the preparation, and higher percentages are 

15 preferred such as at least 94% pure, at least 95% pure, at least 96% pure, at least 96% 
pure, at least 97% pure, at least 98% pure, at least 99%. and at the most 99.5% pure. The 
polypeptides disclosed herein are preferably in a substantially pure fomn. In particular. It is 
preferred that the polypeptides disclosed herein are in "essentially pure fomi". i.e. that the 
polypeptide preparation is essentially free of other polypeptide material with which it is 

20 natively associated. This can be accomplished, for example, by preparing the polypeptide 
by means of well-known recombinant methods. Herein, the temn "substantially pure 
polypeptide" is synonymous with the terms "isolated polypeptide" and "polypeptide In 
isolated form". 

[581 In general, nucleic acid molecules which encode the 029cel will hybridize, under 
25 moderate to high stringency conditions to the sequence provided herein as SEQ ID NO:2 
(the 029cel), However, in some cases a 029cel-encoding nucleotide sequence is employed 
that possesses a substantially different codon usage, while the protein encoded by the 
029cel-encodlng nucleotide sequence has the same or substantially the same amino acid 
sequence as the native protein. For example, the coding sequence may be modified to 
30 facilitate faster expression of 029cel in a particular prokaryotic or eukaryotic expression 
system, in accordance with the frequency with which a particular codon is utilized by the 
host. Te'o, et al. (2000), for example, describes the optimization of genes for expression In 
filamentous fungi. 

(591 A nucleic acid sequence Is considered to be "selectively hybridizable" to a reference 
35 nucleic acid sequence if the two sequences specifically hybridize to one another under 

moderate to high stringency hybridization and wash conditions. Hybridization conditions are 
based on the melting temperature (Tm) of the nucleic acid binding complex or probe. For 
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example, "maximum stringency" typically occurs at about Tm-5X (5° below the Tm of tlie 
probe); "high stringency" at about 5-10" below the Tm; "moderate " or "intennediate 
stringency" at about 10-20° below the Tm of the probe; and "low stringency" at about 20-25° 
below the Tm. Functionally, maximum stringency conditions may be used to identify 
5 sequences having strict identity or near-strict identity with the hybridization probe; while high 
stringency conditions are used to Identify sequences having about 80% or more sequence 
identity with the probe. 

[601 Moderate and high stringency hybridization conditions are well known in the art (see, 
for example, Sambrook, etal. 1989. Chapters 9 and 11, and in Ausubel, F.M., etal.. 1993. 
10 expressly incorporated by reference herein). An example of high stringency conditions 
includes hybridization at about 42°C in 50% fonnamide. 5X SSC, 5X Denhardt's solution, 
0.5% SDS and 100 jig/ml denatured canrier DNA followed by washing two times in 2X SSC 
and 0.5% SDS at room temperature and two additional times in 0.1 X SSC and 0.5% SDS at 
42*C. 

1 5 [61] As used herein, the temis "transfomied", "stably transfomied" or "transgenic" with 
reference to a cell means the cell has a non-native (heterologous) nucleic acid sequence 
integrated into Its genome or as an episomal plasmid that is maintained through multiple 
generations. 

[621 As used herein, the term "expression" refers to the process by which a polypeptide is 
20 produced based on the nucleic acid sequence of a gene. The process Includes both 
transcription and translation. 

[65] The term "Introduced" in the context of inserting a nucleic acid sequence into a cell, 
means "transfection". or "transfonnation" or "transduction" and includes reference to the 
incorporation of a nucleic acid sequence into a eukaryotic or prokaryotic cell where the 

25 nucleic acid sequence may be incorporated into the genome of the cell (for example, 
chromosome, plasmid, plastid. or mitochondrial DNA), converted into an autonomous 
replicon, or transiently expressed (for example, transfected mRNA). 
[641 It follows that the term "029cel expression" refers to transcription and translation of 
the 029cel cellulase gene, the products of which include precursor RNA. mRNA, 

30 polypeptide, post-translationally processed polypeptides. By way of example, assays for 
^ 029cel expression include Westem blot for 029cel protein. Northern blot analysis and 

reverse transcriptase polymerase chain reaction (RT-PCR) assays for 029cel mRNA, and 
endoglucanase activity assays as described in Shoemaker S.P. and Brown R.D.Jr. (Biochim. 
Biophys. Acta, 1978, 523:133-146) and Schulein (1988). 

35 [651 As used herein, the temi "surfactant" refers to any compound generally recognized in 
the art as having surface active qualities. Thus, for example, surfactants comprise anionic, 
cationic and nonlonic surfactants such as those commonly found in detergents. Anionic 
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surfactants include linear or branched alkylbenzenesulfonates; alkyi or alkenyl ether sulfates 
having linear or branched alkyI groups or alkenyl groups; alkyI or alkenyl sulfates; 
olefinsulfonates; and aikanesulfonates. Ampholytic surfactants include quaternary 
ammonium salt sulfonates, and betaine-type ampholytic surfactants. Such ampholytic 
5 surfactants have both the positive and negative charged groups in the same molecule. 
Nonionic surfactants may comprise polyoxyalkylene ethers, as well as higher fatty acid 
alkanolamides or alkylene oxide adduct thereof, fatty acid glycerine monoesters, and the 
like. 

[66] As used herein, the term "cellulose containing fabric" refers to any sewn or unsewn 
10 fabrics, yams or fibers made of cotton or non-cotton containing cellulose or cotton or non- 
cotton containing cellulose blends including natural cellulosics and manmade celluloslcs 
(such as jute, flax, ramie, rayon, and lyocell). 

(671 As used herein, the term "cotton-containing fabric" refers to sewn or unsewn fabrics, 
yarns or fibers made of pure cotton or cotton blends including cotton woven fabrics, cotton 

15 knits, cotton denims, cotton yarns, raw cotton and the like. 

[681 As used herein, the temi "stonewashing composition" refers to a formulation for use 
in stonewashing cellulose containing fabrics. Stonewashing compositions are used to 
modify cellulose containing fabrics prior to sale, i.e.. during the manufacturing process. In 
contrast, detergent compositions are intended for the cleaning of soiled gamnents and are 

20 not used during the manufacturing process. 

[691 As used herein, the term "detergent composition" refers to a mixture which is 
intended for .use in a wash medium for the laundering of soiled cellulose containing fabrics, 
In the context of the present invention, such compositions may include, in addition to 
cellulases and surfactants, additional hydrolytic enzymes, builders, bleaching agents, bleach 

25 activators, bluing agents and fluorescent dyes, caking inhibitors, masking agents, cellulase 
activators, antioxidants, and solubilizers. 

[701 As used herein, the terms "active" and "biologically active" refer to a biological activity 
associated with a particular protein and are used interchangeably herein. For example, the 
enzymatic activity associated with a protease is proteolysis and, thus, an active protease has 

30 proteolytic activity. It follows that the biological activity of a given protein refers to any 

biological activity typically attributed to that protein by those of skill in the art. ^ 
[711 When employed in enzymatic solutions, the 029cel component is generally added in 
an amount sufficient to allow the highest rate of release of soluble sugars from the biomass. 
The amount of 029cel component added depends upon the type of biomass to be 

35 saccharified which can be readily determined by the skilled artisan. However, when 

employed, the weight percent of the 029cel component relative to any other cellulase type 
components present in the cellulase composition is from preferably about 1. preferably about 
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5, preferably about 10, preferably about 15, or preferably about 20 weight percent to 
preferably about 25, preferably about 30. preferably about 35, preferably about 40, 
preferably about 45 or preferably about 50 weight percent. Furthermore, prefen-ed ranges 
may be about 0.5 to about 15 weight percent, about 0.5 to about 20 weight percent, from 

5 about 1 to about 10 weight percent, from about 1 to about 1 5 weight percent, from about 1 to 
about 20 weight percent, from about 1 to about 25 weight percent, from about 5 to about 20 
weight percent, from about 5 to about 25 weight percent, from about 5 to about 30 weight 
percent, from about 5 to about 35 weight percent, from about 5 to about 40 weight percent, 
from about 5 to about 45 weight percent, from about 5 to about 50 weight percent, from 

10 about 1 0 to about 20 weight percent, from about 1 0 to about 25 weight percent, from about 
10 to about 30 weight percent, from about 10 to about 35 weight percent, from about 10 to 
about 40 weight percent, from about 10 to about 45 weight percent, from about 10 to about 
50 weight percent, from about 15 to about 20 weight percent, from about 15 to about 25 
weight percent, from about 15 to about 30 weight percent, from about 15 to about 35 weight 

1 5 percent, from about 1 5 to about 30 weight percent, from about 1 5 to about 45 weight 
percent, from about 15 to about 50 weight percent. 

II. MOLECULAR BIOLOGY 

[72] This invention relies on routine techniques in the field of recombinant genetics. Basic 
texts disclosing the general methods of use in this invention include Sambrook et al.. 
20 Molecular Cloning. A Laboratory Manual (2nd ed. 1 989); Kriegler, Gene Transfer and 
Expression: A laboratory Manual (1990); and Ausubel et al., eds.. Currenf Protocols in 
Molecular Biology (1 994)). 

[73] To obtain high level expression of a cloned gene, the heterologous gene is preferably 
positioned about the same distance from the promoter as is In the naturally occunring 
25 cellulase gene. As is known in the art, however, some variation in this distance can be 
accommodated without loss of promoter function. 

[74] Those skilled in the art are aware that a natural promoter can be modified by 
replacement, substitution, addition or elimination of one or more nucleotides without 
changing its function. The practice of the invention encompasses and is not constrained by 

30 such alterations to the promoter. 

175] The expression vector/construct typically contains a transcription unit or expression 
cassette ttiat contains all the additional elements required for the expression of the 
heterologous sequence. A typical expression cassette thus contains a promoter operably 
linked to the heterologous nucleic acid sequence and signals required for efticient 

35 polyadenylation of the transcript, ribosome binding sites, and translation termination. 
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Additional elements of the cassette may include enhancers and, if genomic DNA is used as 
the stnjctural gene, introns with functional splice donor and acceptor sites. 
[761 The practice of the invention is not constrained by the choice of promoter in the 
genetic construct. The only constraint on the choice of promoter is that it is functional in the 
5 host cell used. A preferred promoter when the transformation host cell is Bacillus is the aprE 
promoter. 

[77] In addition to a promoter sequence, the expression cassette should also contain a 
transcription termination region downstream of the structural gene to provide for efficient 
termination. The termination region may be obtained from the same gene as the promoter 

10 sequence or may be obtained from different genes. 

1781 The particular expression vector used to transport the genetic information into the cell 
is not particularly critical. Any of the conventional vectors used for expression in eukaryotic 
or prokaryotic cells may be used. Standard bacterial expression vectors include 
bacteriophages A and M13, as well as plasmids such as pBR322 based plasmids, pSKF, 

15 pET23D. and fusion expression systems such as MBP, GST, and LacZ. Epitope tags can 
also be added to recombinant proteins to provide convenient methods of Isolation, e.g., c- 
myc. 

[791 The elements that are typically Included in expression vectors also include a replicon, 
a gene encoding antibiotic resistance to pennit selection of bacteria that hariDor recombinant 

20 plasmids, and unique restriction sites In . nonessential regions of the plasmid to allow 

Insertion of heterologous sequences. The particular antibiotic resistance gene chosen is not 
critical, any of the many resistance genes known in the art are suitable. 
[801 The methods of transformation of the present invention may result In the stable 
integration of all or part of the transformation vector into the genome of the filamentous 

25 fungus. However, transformation resulting In the maintenance of a self-replicating extra- 
chromosomal transformation vector is also contemplated. 

[811 The gene encoding the cellulase of the present invention can be cloned using 
phage (expression) vectors and E. coll host cells. (Alternatively PGR cloning using 
consensus primers designed on conserved domains may be used.) Applicants have 

30 discovered that transfomiation of the gene encoding the cellulase of the present invention 
and expression In E. co// results in an active protein. After a first cloning step in E. coli, a 
cellulase gene according to the present invention can be transfered to a more preferred 
Industrial expression host such as Bacillus or Streptomyces species, a filamentous fungus 
such as Aspergillus or Trichoderma. or a yeast such as Saccharomyces. High level 

35 expression and secretion obtainable in these host organisms allows accumulation of the 
cellulase In the femientatlon medium from which it can subsequently be recovered. 
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[82] A preferred general transformation and expression protocol for protease deleted 
Bac/7/us strains is provided in Ferrari et al., U.S. Patent No. 5.264,366. incorporated herein 
by reference. Transformation and expression in Aspergillus is described in, for example, 
Berka et al.. U.S. Patent No. 5.364.770, incorporated herein by reference. 
(831 Many standard transfection methods can be used to produce Trichoderma reesei cell 
lines that express large quantities of the heterologus protein. Some of the published 
methods for the introduction of DNA constructs into cellulase-producing strains of 
Trichoderma include Lorito. Hayes, DiPietro and Harman, 1993. Cun". Genet. 24: 349-356; 
Goldman. VanMontagu and Herrera-Estrella. 1990, Curr. Genet. 17:169-174; Penttila. 
Nevalainen. Ratto, Salminen and Knowles. 1987, Gene 6: 155-164. for Aspergillus Yelton, 
Hamer and Timberlake, 1984. Proc. Natl. Acad. Sci. USA 81: 1470-1474. for Fusarium 
Bajar. Podila and Kolattukudy. 1991. Proc. Natl. Acad. Sci. USA 88: 8202-8212. for 
Streptomyces Hopwood et al.. 1985. The John Innes Foundation, Nonwich. UK and for 
Bacillus Brigldi, DeRossi, Bertarini. Riccardi and Matteuzzi. 1990. FEMS Microbiol. Lett. 55: 
135-138), all incorporated herein by reference. 

[841 However, any of the well-known procedures for introducing foreign nucleotide 
sequences into host cells may be used. These include the use of calcium phosphate 
transfection. polybrene, protoplast fusion, electroporation, biolistics, liposomes, 
microinjection, plasma vectors, viral vectors and any of the other well known methods for 
introducing cloned genomic DNA. cDNA. synthetic DNA or other foreign genetic material into 
a host cell (see, e.g.. Sambrook et al., supra). Also of use is the Agrobacterium-mediated 
transfection method described in U.S. Patent No. 6,255.1 1 5. It is only necessary that the 
particular genetic engineering procedure used be capable of successfully introducing at least 
one gene into the host cell capable of expressing the heterologous gene. 
[851 After the expression vector is introduced into the cells, the transfected cells are 
cultured under conditions favoring expression of genes under control of cellulase gene 
promoter sequences. Large batches of transformed cells can be cultured as described 
below. Finally, product Is recovered from the culture using standard techniques. 
[861 Thus, the invention herein provides for the expression and enhanced secretion of the 
inventive cellulases whose expression Is under control of cellulase gene promoter 
sequences including naturally occurring cellulase genes, fusion DNA sequences, and 
various heterologous constructs. The invention also provides processes for expressing and 
secreting high levels of the Inventive cellulases, 

in. Identification of Nucleic Acids and Encoded Protein Sequences 

[871 A genomic library from Bacillus agaradhaerans (DSM 8721 ) was prepared using 

standard techniques known In the art. This organism produces an alkaline cellulase, (endo- 

15 



wo 20W/097001 PCT/US2004/013258 



1,4-beta-glucanase), belonging to cellulase family 5 of glycosyl hydrolases, endoglucanase 
5A, EC 3.2.1.4, Swiss-Prot : 085465, entry name GUN5_BACAG. EBI accession number 
AF067428) the gene for which is 1203bp in length. (Davies et al. 1998). Cellulase positive 
clones were detected with an incidence of 1/3000 in the plate assay. In the process for 
5 isolating a gene according to an aspect of the present invention, degenerate primers based 
on the coding sequence for this enzyme were used. Unexpectedly, however, no PCR 
product was obtained using primers known to amplify the known B. agaradhaerans cellulase. 
The complete sequence of the insert coding for the cellulase was therefore determined by 
primer walking. 

10 (S81 The process for isolating a gene according to the second aspect of the present 

invention makes use of its homology to a nucleotide sequence comprising all or part of the 
nucleotide sequence of SEQ ID No.:1 or SEQ ID No:2 as shown in the sequence listing. 
Examples of such processes include: 

a) screening a gene library which presumably contains a 029cel gene using the 
15 nucleotide sequence as a probe. 

b) preparing a primer based on the nucleotide sequence information, then 
performing PCR using a sample which presumably contains a 029cel gene as 
a template. 

20 [89] More specifically, process a) above comprises: 

a) preparing a gene library which presumably contains a cellulase gene, 

screening the gene library using a nucleotide sequence comprising ail or part 
of the nucleotide sequence of SEQ ID No:2 as shown In the sequence listing 
to select sequences which hybridize with the nucleotide sequence comprising 

25 all or part of the nucleotide sequence of SEQ ID No:2 as shown in the 

sequence listing from the gene library, then isolating the selected sequences, 
and isolating a 029cel gene from the sequences which have been selected 
and isolated from thie gene library. 
[90] The gene library may be a genomic DNA library or a cDNA library, and may be 

30 prepared according to a known procedure. 

IV. PROTEIN EXPRESSION 

[91] Proteins of the present invention are produced by culturing cells transfomied with an 
expression vector containing the inventive cellulase gene whose expression is under control 
35 of promoter sequences. The present invention Is particulariy useful for enhancing the 

intracellular and/or extracellular production of proteins. The protein may be homologous or 
heterologous. 
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[921 Proteins of the present invention may also be modified in a way to form chimeric 
molecules comprising a protein of interest fused to another, heterologous polypeptide or 
amino acid sequence. In one embodiment, such a chimeric molecule comprises a fusion of 
the protein of interest with a tag polypeptide which provides an epitope to which an anti-tag 
5 antibody can selectively bind. The epitope tag is generally placed at the amino-or carboxyl- 
terminus of the protein of interest. 

[931 Various tag polypeptides and their respective antibodies are well known in the art. 

Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; HIS6 

and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field et ai, 
10 Mol. Cell. Biol 8:2159-2165 (1988)); the c-myc tag and the 8F9. 3C7. 6E10, G4, B7 and 

9E10 antibodies thereto (Evan et ai. Molecular and Cellular Biology 5:3610-3616 (1985)); 

and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky et al.. 

Protein Engineering 3(6):547-553 (1990)). Other tag polypeptides include the FU\G-peptide 

(Hopp et ai, BioTechnology 6:1204-1210 (1988)); the KT3 epitope peptide (Martin ef al.. 
15 Science 255:192-194 (1992)); tubulin epitope peptide (Skinner et al., J. Biol. Chem. 

266:15163-15166 (1991)); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth etal., 

Proc. Natl. Acad. Sci. USA 87:6393-6397 (1990)). 

[941 Conditions appropriate for expression of said 029cel gene comprises providing to the 
culture the components necessary for growth and/or expression of the inventive cellulase. 

20 Optimal conditions for the production of the proteins will vary with the choice of the host cell, 
and with the choice of protein to be expressed. Such conditions will be easily ascertained by 
one skilled in the art through routine experimentation or optimization. 
[951 The protein of interest is typically purified or isolated after expression. TTie protein of 
interest may be isolated or purified in a variety of ways known to those skilled in the art 

25 depending on what other components are present in the sample. Standard purification 

methods include electrophoretic, molecular, immunological and chromatographic techniques, 
including ion exchange, hydrophobic, affinity, and reverse-phase HPLC chromatography, 
and chromatofocusing. For example, the protein of interest may be purified using a standard 
anti-protein of interest antibody column. Ultrafiltration and diafiltration techniques, in 

30 conjunction with protein concentration, are also useful. For general guidance in suitable 
purification techniques, see Scopes, Protein Purification (1982). The degree of purification 
necessary will vary depending on the use of the protein of interest. In some instances no 
purification will be necessary. 

35 V. Utility of cellulase 

[961 Treatment of textiles according to the present invention contemplates textile 
processing or cleaning with a composition comprising the cellulase of this invention. Such 
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treating includes, but is not limited to, stonewashing, modifying the texture, feel and/or 
appearance of cellulose-containing fabrics or other techniques used during manufacturing or 
cleaning/reconditioning of cellulose-containing fabrics. Additionally, treating within the 
context of this invention contemplates the removal of "immature" or "dead" cotton from 

5 celiulosic fabric or fibers. Immature cotton is significantly more amorphous than mature 
cotton and because of, for example, uneven dyeing. The composition contemplated in the 
present invention further includes a cellulase component for use in washing a soiled 
manufactured cellulose-containing fabric. For example, a cellulase of this invention may be 
used in a detergent composition for washing laundry. Detergent compositions useful in 

10 accordance with the present invention include special formulations such as pre-wash, pre- 
soak and home-use color restoration compositions. Such treating compositions, as 
described herein, may be in the form of a concentrate which requires dilution or in the form 
of a dilute solution or a form which can be applied directly to the cellulose-containing fabric. 
General treatment techniques for cellulase treatment of textiles are described in, for 

15 example, EP Publication No. 220 016 and GB Application Nos. 1,368,599 and 2,095,275, 
[97] Treatment of a celiulosic material according to the present invention further 
contemplates the treatment of animal feed, pulp and/or paper, food and grain for purposes 
known in the art. For example, cellulases are known to increase the value of animal feed, 
improve the drainability of wood pulp, enhance food products and reduce fiber in grain 

20 during the grain wet milling process or dry milling process. 

[981 Treating according to the instant invention comprises preparing an aqueous solution 
which contains an effective amount of a cellulase or a combination of cellulases together 
with other optional ingredients including, for example, a buffer, a surfactant, and/or a 
scouring agent. An effective amount of a cellulase enzyme composition is a concentration of 

25 cellulase enzyme sufficient for Its intended purpose. Thus, for example, an "effective 

amount" of cellulase in a stonewashing composition according to the present invention is 
that amount which will provide the desired effect, e.g., to produce a worn and faded look in 
seams and on fabric panels. Similariy, an "effective amount" of cellulase in a composition 
intended for improving the feel and/or appearance of a cellulose-containing fabric is the 

30 amount that produces measurable improvements in the feel, e.g., improving the smoothness 
of the fabric, or appearance, e.g., removing pills and fibrils which tend to reduce the 
sharpness in appearance of a fabric. The amount of cellulase employed is also dependent 
on the equipment employed, the process parameters employed (the temperature of the 
cellulase treatment solution, the exposure time to the cellulase solution, and the like), and 

35 the cellulase activity (e.g., a particular solution will require a lower concentration of cellulase 
where a more active cellulase composition is used as compared to a less active cellulase 
composition). The exact concentration of cellulase in the aqueous treatment solution to 
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Which the fabric to be treated is added can be readily determined by the skilled artisan based 
on the above factors as well as the desired result. In stonewashing processes, it has 
generally been preferred that the cellulase be present in the aqueous treating solution In a 
concentration of from about 0.5 to 5.000 ppm and most preferably about 10 to 200 ppm total 
protein. In compositions for the improvement of feel and/or appearance of a cellulose- 
containing fabric, it has generally been preferred that the cellulase be present in the aqueous 
treating solution in a concentration of from about 0.1 to 2000 ppm and most preferably about 
0.5 to 200 ppm total protein. 

[99] In a preferred treating embodiment, a buffer is employed in the treating composition 
such that the concentration of buffer is sufficient to maintain the pH of the solution within the 
range wherein the employed cellulase exhibits activity. The pH at which the cellulase 
exhibits activity depends on the nature of the cellulase employed. The exact concentration 
of buffer employed will depend on several factors which the skilled artisan can readily take 
into account. For example, in a preferred embodiment, the buffer as well as the buffer 
concentration are selected so as to maintain the pH of the final cellulase solution within the 
pH range required for optimal cellulase activity. The determination of the optimal pH range 
of the cellulases of the invention can be ascertained according to well-known techniques. 
Suitable buffers at pH within the activity range of the cellulase are also well known to those 
skilled In the art In the field. ' 

[1001 in addition to cellulase and a buffer, the treating composition may optionally contain a 
surfactant. Suitable surfactants include any surfactant compatible with the cellulase being 
utilized and the fabric including, for example, anionic. non-Ionic and ampholytic surfactants. 
Suitable anionic surfactants include, but are not limited to, linear or branched 
alkylbenzenesulfonates; alkyi or alkenyl ether sulfates having linear or branched alkyi groups 
or alkenyl groups; alkyi or alkenyl sulfates; olefinsulfonates; alkanesulfonates and the like. 
Suitable counter ions for anionic surfactants Include, but are not limited to. alkali metal ions 
such as sodium and potassium; alkaline earth metal ions such as calcium and magnesium; 
ammonium Ion; and alkanolamines having 1 to 3 alkanol groups of carbon number 2 or 3. 
Ampholytic surfactants include, e.g., quaternary ammonium salt sulfonates, and betalne-type 
ampholytic surfactants. Such ampholytic surfactants have both the positive and negative 
charged groups in the same molecule. Nonionic surfactants generally comprise 
polyoxyalkylene ethers, as well as higher fatty acid alkanolamides or alkylene oxide adduct 
thereof, and fatty acid glycerine monoesters. Mixtures of surfactants can also be employed 
In manners known to those skilled in the art. 

(1011 A concentrated cellulase composition can be prepared for use in the methods 
described herein. Such concentrates contain concentrated amounts of the cellulase 
composition described above, buffer and surfactant, preferably in an aqueous solution. 
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When so formulated, the cellulase concentrate can readily be diluted with water so as to 
quickly and accurately prepare cellulase preparations having the requisite concentration of 
each constituent. When aqueous concentrates are formulated, these concentrates can be 
diluted so as to amve at the requisite concentration of the components in the cellulase 
solution as indicated above. As is readily apparent, such cellulase concentrates permit facile 
fonnulation of the cellulase solutions as well as permit feasible transportation of the 
composition to the location where it will be used. The treating concentrate can be in any art- 
recognized form, for example, liquid, emulsion, gel. or paste. Such forms are well known to 
those skilled in the art. 

11021 When a solid cellulase concentrate is employed, the cellulase composition may be a 
granule, a powder, an agglomerate or a solid disk. The granules can be formulated so as to 
contain materials to reduce the rate of dissolution of the granules into the wash medium. 
Such materials and granules are disclosed in U.S. Patent No. 5,254.283 which is 
incorporated herein by reference in its entirety. 

11031 Other materials can also be used with or placed in the cellulase composition of the 
present invention as desired, including stones, pumice, fillers, solvents, enzyme activators, 
and anti-redeposltion agents depending on the eventual use of the composition. 
[1041 By way of example, stonewashing methods will be described in detail, however, the 
parameters described are readily modified by the skilled artisan for other applications, i.e.. 
improving the feel and/or appearance of a fabric. The cellulose-containing fabric is 
contacted with the cellulase containing stonewashing composition containing an effective 
amount of the cellulase by Intermingling the treating composition with the stonewashing 
composition, and thus bringing the cellulase enzyme into proximity with the fabric. 
Subsequently, the aqueous solution containing the cellulase and the fabric is agitated. If the 
treating composition is an aqueous solution, the fabric may be directly soaked in the 
solution. SImllariy. where the stonewashing composition is a concentrate, the concentrate is 
diluted Into a water bath with the cellulose-containing fabric. When the stonewashing 
composition is In a solid form, for example a pre-wash gel or solid stick, the stonewashing 
composition may be contacted by directly applying the composition to the fabric or to the 
wash liquor. 

11051 The cellulose-containing fabric is incubated with the stonewashing solution under 
conditions effective to allow the enzymatic action to confer a stonewashed appearance to 
the cellulose-containing fabric. For example, during stonewashing. the pH. liquor ratio, 
temperature and reaction time may be adjusted to optimize the conditions under which the 
stonewashing composition acts. "Effective conditions" necessarily refers to the pH, liquor 
ratio, and temperature which allow the cellulase enzyme to react efficiently with cellulose- 
containing fabric, in this case to produce the stonewashed effect. It is within the skill of 
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those in the art to maximize conditions for using the stonewashing compositions according to 
the present invention. 

[106] The liquor ratios during stonewashing, i.e., the ratio of weight of stonewashing 
composition solution (/.e., the wash liquor) to the weight of fabric, employed herein is 
generally an amount sufficient to achieve the desired stonewashing effect in the denim fabric 
and is dependent upon the process used. Preferably, the liquor ratios are from about 4:1 to 
about 50:1; more preferably from about 5:1 to about 20:1. and most preferably from about 
10:1 to about 15:1. 

[107] Reaction temperatures during stonewashing with the present stonewashing 
compositions are governed by two competing factors. Firstly, higher temperatures generally 
correspond to enhanced reaction kinetics, i.e.. faster reactions, which permit reduced 
reaction times as compared to reaction times required at lower temperatures. Accordingly, 
reaction temperatures are generally at least about 10°C and greater. Secondly, cellulase is 
a protein which loses activity beyond a given reaction temperature, which temperature is 
dependent on the nature of the cellulase used. Thus, if the reaction temperature is pemiitted 
to go too high, the cellulolytic activity Is lost as a result of the denaturing of the cellulase. 
While standard temperatures for cellulase usage in the art are generally in the range of 35°C 
to 65'C, and these conditions would also be expected to be suitable for the cellulase of the 
invention, the optimal temperature conditions should be ascertained according to well known 
techniques with respect to the specific cellulase used. 

[1081 Reaction times are dependent on the specific conditions under which the 
stonewashing occure. For example. pH. temperature and concentration of cellulase will all 
affect the optimal reaction time. Generally, reaction times are from about 5 minutes to about 
5 hours, and preferably from about 10 minutes to about 3 hours and. more preferably, from 
about 20 minutes to about 1 hour. 

[1091 According to yet another preferred embodiment of the present invention, the cellulase 
of the invention may be employed in a detergent composition. The detergent compositions 
according to the present invention are useful as pre-wash compositions, pre-soak 
compositions, or for cleaning during the regular wash or rinse cycle. Preferably, the 
detergent composition of the present invention comprises an effective amount of cellulase. a 
surfactant, and optionally includes other ingredients described below. 
[1101 An effective amount of cellulase employed in the detergent compositions of this 
invention is an amount sufficient to impart the desirable effects known to be produced by 
cellulase on cellulose-containing fabrics, for example, depilling. softening, anti-piiling, 
surface fiber removal, anti-graying and cleaning. Preferably, the cellulase in the detergent 
composition is employed in a concentration of from about 10 ppm to about 20.000 ppm of 
detergent. 



21 



wo 2004/097001 



PCT/US2004/013258 



[1111 The concentration of cellulase enzyme employed in the detergent composition is 
preferably selected so that upon dilution into a wash medium, the concentration of cellulase 
enzyme is in a range of about 0.01 to about 1000 ppm, preferably from about 0.02 ppm to 
about 500 ppm, and most preferably from about 0.5 ppm to about 250 ppm total protein. 
5 The amount of cellulase enzyme employed in the detergent composition will depend on the 
extent to which the detergent will be diluted upon addition to water so as to form a wash 
solution. 

[1121 The detergent compositions of the present invention may be in any art recognized 
form, for example, as a liquid, in granules, in emulsions, in gels, or in pastes. Such forms 

10 are well known to the skilled artisan. When a solid detergent composition is employed, the 
cellulase is preferably formulated as granules. Preferably, the granules can be fonnulated 
so as to additionally contain a cellulase protecting agent. The granule can be formulated so 
as to contain materials to reduce the rate of dissolution of the granule into the wash medium. 
Such materials and granules are disclosed in U.S. Patent No. 5,254,283 which is 

15 incorporated herein by reference In Its entirety. 

1113] The detergent compositions of this invention employ a surface active agent, i.e., 
surfactant, including anionic, non-ionic and ampholytic surfactants well known for their use In 
detergent compositions. 

[1141 Suitable anionic surfactants for use in the detergent composition of this invention 

20 include linear or branched alkylbenzenesulfonates; alkyi or alkenyl ether sulfates having 
linear or branched alkyI groups or alkenyl groups; alkyl or alkenyl sulfates; olefinsulfonates; 
and alkanesul-fonates. Suitable counter ions for anionic surfactants include alkali metal ions 
such as sodium and potassium; alkaline earth metal ions such as calcium and magnesium; 
ammonium ion; and alkanolamines having 1 to 3 alkanol groups of carbon number 2 or 3. 

25 Ampholytic surfactants Include quaternary ammonium salt sulfonates, and betaine-type 
ampholytic surfactants. Such ampholytic surfactants have both the positive and negative 
charged groups in the same molecule. Nonionic surfactants generally comprise polyoxyal- 
kylene ethers, as well as higher fatty acid alkanolamides or alkylene oxide adduct thereof, 
fatty acid glycerine monoesters, and the like. Suitable surfactants for use in this invention 

30 are disclosed in British Patent Application No. 2 094 826 A, the disclosure of which is 
incorporated herein by reference. Mixtures of such surfactants can also be used. The 
surfactant or a mixture of surfactants is generally employed in the detergent compositions of 
this invention In an amount from about 1 weight percent to about 95 weight percent of the 
total detergent composition and preferably from about 5 weight percent to about 45 weight 

35 percent of the total detergent composition. In addition to the cellulase composition and the 
surfactant(s). the detergent compositions of this invention can optionally contain one or more 
of the following components: 
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Hydrolases Except Cellulase 

[1151 Suitable hydrolases include carboxylate ester hydrolase, thioester hydrolase, 
phosphate monoester hydrolase, and phosphate diester hydrolase which act on the ester 
bond; glycoside hydrolase which acts on glycosyl compounds; an enzyme that hydrolyzes N- 
5 glycosyl compounds; thioether hydrolase which acts on the ether bond; and a-amino-acyl- 
peptide hydrolase, peptidyl-amino acid hydrolase, acyl-amino acid hydrolase, dipeptide 
hydrolase, and peptidyl-peptide hydrolase which act on the peptide bond. Preferable among 
them are carboxylate ester hydrolase, glycoside hydrolase, and peptidyl-peptide hydrolase. 
Suitable hydrolases include (1) proteases belonging to peptidyl-peptide hydrolase such as 
10 pepsin, pepsin B, rennin. trypsin, chymotrypsin A, chymotrypsin B, elastase, enterokinase, 
cathepsin C, papain, chymopapain, ficin, thrombin, fibrinolysin, renin, subtilisin, 
aspergillopeptidase A, collagenase, clostridiopeptidase B, kallikrein, gastrisin. cathepsin D., 
bromelin, keratinase, chymotrypsin C, pepsin C, aspergillopeptidase B, urokinase, 
carboxypeptidase Aand B. and aminopeptidase; (2) glycoside hydrolases (cellulase which is 
15 an essential ingredient is excluded from this group) a-amylase, li-amylase, gluco amylase, 
invertase, lysozyme, pectinase, chitinase, and dextranase. Preferably among them are a- 
amylase and B-amylase. They function in acid to neutral systems, but one which Is obtained 
from bacteria exhibits high activity in an alkaline system; (3) carboxylate ester hydrolase 
including carboxyl esterase, lipase, pectin esterase, and chlorophyllase. Especially effective 
20 among them is lipase. 

[1161 The hydrolase other than cellulase is incorporated into the detergent composition as 
much as required according to the purpose. It should preferably be incorporated In an 
amount of 0.001 to 5 weight percent, and more preferably 0.02 to 3 weight percent, in temns 
of purified protein. This enzyme should be used in the form of granules made of crude 
25 enzyme alone or in combination with other components in the detergent composition. 

Granules of crude enzyme are used in such an amount that the purified enzyme Is 0.001 to 
50 weight percent in the granules. The granules are used in an amount of 0.002 to 20 and 
preferably 0.1 to 10 weight percent. As with cellulases, these granules can be fomnulated so 
as to contain an enzyme protecting agent and a dissolution retardant material. 
30 Cationic Surfactants and Lona-Chain Fattv Acid Salts 

[1171 Such cationic surfactants and long-chain fatty acid salts include saturated or 
unsaturated fatty acid salts, alkyi or alkenyl ether cart)oxyllc acid salts, a-sulfofatty acid salts 
or esters, amino acid-type surfactants, phosphate ester surfactants, quaternary ammonium 
salts including those having 3 to 4 alkyI substituents and up to 1 phenyl substituted alkyI 
35 substituents. Suitable cationic surfactants and long-chain fatty acid salts are disclosed In 
British Patent Application No. 2 094 826 A, the disclosure of which Is incorporated herein by 

23 



wo 2(M)4/097001 



PCT/US2004/013258 



reference. The composition may contain from about 1 to about 20 weight percent of such 

cationic surfactants and long-chain fatty acid salts. 

Builders 

A. Divalent sequestering agents 

5 [118] The composition may contain from about 0 to about 50 weight percent of one or more 
builder components selected from the group consisting of alkali metal salts and alkanolamine 
salts of the following compounds: phosphates, phosphonates, phosphonocarboxylates, salts 
of amino acids, aminopolyacetates high molecular electrolytes, non-dissociating polymers, 
salts of dicarboxylic acids, and aluminosilicate salts. Suitable divalent sequestering gents 

10 are disclosed in British Patent Application No. 2 094 826 A, the disclosure of which is 
incorporated herein by reference. 

B. Alkalis or inorganic electrolytes 

[1191 The composition may contain from about 1 to about 50 weight percent, preferably 
from about 5 to about 30 weight percent, based on the composition of one or more alkali 
15 metal salts of the following compounds as the alkalis or inorganic electrolytes: silicates, 

carbonates and sulfates as well as organic alkalis such as triethanolamine, diethanolamine, 
monoethanolamine and triisopropanolamine. 
Antiredeposition Agents 

[1201 The composition may contain from about 0.1 to about 5 weight percent of one or 
20 more of the following compounds as antiredeposition agents: polyethylene glycol, polyvinyl 
alcohol, polyvinylpyrrolidone and carboxymethylcellulose. 

[121] Among them, a combination of carboxymethyl-cellulose and/or polyethylene glycol 
with the cellulase composition of the present Invention provides for an especially useful dirt 
removing composition, 

25 Bieachino Agents 

[122] The use of the cellulase of the present invention in combination with a bleaching 
agent such as potassium monopersulfate, sodium percarbonate, sodium perborate, sodium 
sulfate/hydrogen peroxide adduct and sodium chloride/hydrogen peroxide adduct or/and a 
photo-sensitive bleaching dye such as zinc or aluminum salt of sulfonated phthalocyanine 

30 further improves the detergenting effects. Similarly, bleaching agents and bleach catalysts 
as described in EP 684 304 may be used. 
Bluing Agents and Fluorescent Dves 

[1231 Various bluing agents and fluorescent dyes may be incorporated in the composition, 
if necessary. Suitable bluing agents and fluorescent dyes are disclosed in British Patent 
35 Application No. 2 094 826 A, the disclosure of which is incorporated herein by reference. 
Caking Inhibitors 
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[124] The following caking inhibitors may be incorporated in the powdery detergent: p- 
toluenesulfonic acid salts, xylenesulfonic acid salts, acetic acid salts, sulfosuccinic acid salts, 
talc, finely pulverized silica, amorphous silicas, clay, calcium -silicate (such as Micro-Cell of 
Johns Manville Co.). calcium carbonate and magnesium oxide. 
Antioxidants 

[1251 The antioxidants include, for example, tert-butyl-hydroxytoluene, 4,4'-butylidenebis(6- 
tert-butyl-3-methylphenol). 2.2'-butylidenebis(6-tert-butyl-4-methytphenol), monostyrenated 
cresol. distyrenated cresoi, monostyrenated phenol, distyrenated phenol and 1.1-bis(4- 
hydroxy-phenyl)cyclohexane. 
Solubilizers 

11261 The solubilizers include, for example, lower alcohols such as ethanol. 
benzenesulfonate salts, lower alkylbenzenesulfonate salts such as p-toluenesulfonate salts, 
glycols such as propylene glycol, acetylbenzene-sulfonate salts, acetamides. 
pyridinedicarboxylic acid amides, benzoate salts and urea. 

11271 The detergent composition of the present invention can be used in a broad pH range 
from acidic to alkaline pH. In a preferred embodiment, the detergent composition of the 
present invention can be used in mildly acidic, neutral or alkaline detergent wash media 
having a pH of from above 5 to no more than about 12. 

[1281 Aside from the above ingredients, perfumes, buffers, preservatives, dyes and the like 
can be used, if desired, with the detergent compositions of this invention. Such components 
are conventionally employed in amounts heretofore used in the art. 
[1291 When a detergent base used In the present invention is in the forni of a powder, it 
may be one which is prepared by any known preparation methods including a spray-drying 
method and a granulation method. The detergent base obtained particularly by the spray- 
drying method, agglomeration method, dry mixing method or non-tower route methods are 
preferred. The detergent base obtained by the spray-drying method is not restricted with 
respect to preparation conditions. The detergent base obtained by the spray-drying method 
is hollow granules which are obtained by spraying an aqueous slurry of heat-resistant 
ingredients, such as surface active agents and builders. Into a hot space. After the spray- 
drying, perfumes, enzymes, bleaching agents, inorganic alkaline builders may be added. 
With a highly dense, granular detergent base obtained such as by the spray-drying- 
granulation or agglomeration method, various ingredients may also be added after the 
preparation of the base. 

[1301 When the detergent base is a liquid. It may be either a homogeneous solution or a 
nonhomogeneous dispersion. For removing the decomposition of carboxymethylcellulose by 
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the cellulase in the detergent, it is desirable that carboxymethylcellulose is granulated or 
coated before the incorporation in the composition. 

(1311 The detergent compositions of this invention may be incubated with cellulose- 
containing fabric, for example soiled fabrics, in industrial and household uses at 
5 temperatures, reaction times and liquor ratios conventionally employed in these 
environments. 

[132] Detergents according to the present invention may additionally be formulated as a 
pre-wash in the appropriate solution at an intermediate pH where sufficient activity exists to 
provide desired improvements softening, depilling, pilling prevention, surface fiber removal 

10 or cleaning. When the detergent composition Is a pre-soak {e.g.. pre-wash or pre-treatment) 
composition, either as a liquid, spray, gel or paste composition, the cellulase enzyme is 
generally employed from about 0.0001 to about 1 weight percent based on the total weight 
of the pre-soak or pre-treatment composition. In such compositions, a surfactant may 
optionally be employed and when employed, is generally present at a concentration of from 

15 about 0.005 to about 20 weight percent based on the total weight of the pre-sOak. The 

remainder of the composition comprises conventional components used in the pre-soak, i.e., 
diluent, buffers, other enzymes (proteases), and the like at their conventional concentrations. 
[1331 It is contemplated that compositions comprising cellulase enzymes described herein 
can be used In home use as a stand aione composition suitable for restoring color to faded 

20 fabrics (see. for example, U.S. Patent No. 4,738,682, which is incorporated herein by 

reference in its entirety) as well as used in a spot-remover and for depilling and antipilling 
(pilling prevention). 

[134] The use of the cellulase according to the invention may be particulariy effective in 
feed additives and in the processing of pulp and paper. These additional industrial 
25 applications are described In, for example. PCT Publication No. 95/16360 and Finnish 
Granted Patent No. 87372. respectively. * 
[1351 In order to further Illustrate the present invention and advantages thereof, the 
following specific examples are given with the understanding that they are being offered to 
illustrate the present invention and should not be construed in any way as limiting its scope. 

30 EXAMPLES 

[1361 The following examples are offered to illustrate, but not to limit the claimed invention. 

Example 1 
Sample collection and processing 

[137] This example illustrates how to colled samples and process them to obtain sufficient 
35 DNA to create a cDNA library. 
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11381 Samples of water (250 ml) were collected from the littoral zone of Sonachi (Crater) 
Lake. Kenya using a 250-m I stainless steel beaker mounted on the end of a flexible 
extendible 1-m pole and placed in sealable plastic containers (Whirlpak) for transport to the 
laboratory at ambient temperature. The temperature of the surface waters was 28°C. with pH 
10 and a conductivity of 7.23 mS cm"^ (at 27"'C). 

11391 To collect the microbial flora, water (750 ml) from Sonachi (Crater) Lake, Kenya was 
filtered on site (using a hand operated vacuum pump) through a sequence of sterile 
membrane filters (47 mm diameter), composed of cellulose nitrate or cellulose acetate, of 
decreasing pore size, until all water flow stopped. The sequence of filters was 8nm. 3^m and 
0.22nm. The Individual membrane filters were placed immediately into 10 ml of cold, sterile 
cell stabilization buffer (TES) containing 10 mM Tris HCI, pHS.O; 1 mM EDTA and 5% w/v 
NaCI In 30 ml sterile plasfic universal tubes and kept on ice In a refrigerated cool box until 
they could be processed further, usually within 4 hours of sampling. The microbial material 
on the filters was dispersed by vigorous vortex mixing with sterile glass beads (5 ml) and the 
cells pelleted In microfuge tubes by centrifugation at 13,000g for 5 min. The microbial 
material was aliquoted to the microfuge tubes In volumes estimated to contain the equivalent 
of 10^ to 10^ bacterial cells, giving a total of 12 tubes. The DNA was extracted using the 
GenomlcPrep™ Cells and Tissue DNA isolation kit (Amersham Phannacia Biotech. 
Piscataway, NJ, USA) following the manufacturer's instructions. Cells In each tube were 
resuspended in 600 yil of the Cell Lysis Solution provided, and Incubated at SO'C for 5 min to 
lyse the cells. Samples prepared by this method are stable at room temperature for at least 
18 months, and were transported back to the laboratory in this form. DNA extraction was 
completed by RNase A treatment, protein precipitation and isopropanol precipitation of the 
DNA following the manufacturer's protocol. Each DNA pellet was dissolved In lOOjil sterile 
Tris buffer 10mM pH 8.5. 

[1401 DNA yield was estimated by running 5^1 samples on a 0.5% w/v agarose gel and 
comparing with known amounts of bacterial genomic DNA. The samples were pooled, giving 
a total of about 20M.g DNA. Since yields were low. the material was supplemented with about 
30% extra material extracted from the water samples which were collected at the same time 
as the on-site material and stored at 4°C in the laboratory until required. This amount of 
DNA, about 30iig, was the amount of starting material that preliminary experiments had 
shown was needed to carry out the trial and bulk restriction digestion and size fractionation 
to give sufficient material for library construction. 
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Example 2 
Library Construction 

[141] The following example details how to prepare a DNA library for use in screening and 

detection of novel sequences in E. coli. 

Preparation of DNA 

(1421 The pooled DNA was used for constmcUon of the genomic DNA library. The purified 
DNA was partially digested with Sai/3A1 to give an average fragment size of about 5 kb. 
Restricted DNA was size fractionated by electrophoresis on 0.5% agarose in TAE (0.04M 
Tris-acetate. 0.001 M EDTA pH 8.0). Material in the 1.5 to 10 kb range was excised and 
replaced in a well of the same size cut in an unused part of the agarose gel and 
concentrated to a narrow band by reversed electrical current. The DNA band was excised 
and DNA extracted using the QIAGEN (Crawley. UK) QIAEXII gel extraction kit, following the 
manufacturer's guidelines. The eluted DNA was precipitated with ethanol and resuspended 
in 10 mM Tris HCI buffer, pH 8.5. 

Preparation of Lambda libraries 
[143] The restricted DNA was cloned into a Lambda vector using the ZAP-ExpressTM 

vector kit (predigested with fiamHI and alkaline phosphatase treated) and the Gigapak® III 
Gold packaging extract (Stratagene, Amsterdam. The Netheriands) following the 
manufacturer's protocol. The primary libraries were amplified as per protocol by plating 
aliquots containing ~5x10Vfu with host E. co// strain XL1-Blue MRF' on 150mm Petri dishes 
and eluting the phage in buffer. Amplified libraries were stored in 7% v/v dimethyl sulphoxide 
at -80"C after freezing in liquid nitrogen. The total primary titre was 1 .8 x 10^ pfu and after 
amplification 6.8 x 10® pfu mr\ 

Assessment of Library Quality 
11441 The phagemid vector pBK-CMV was excised from the Lambda ZAP library using 
ExAssist helper phage (Stratagene) as described by the manufacturer, and used to infect 
E.co// strain XLOLR. Plasmid-contalning clones were isolated by plating on Luria - Bertani 
(LB) agar containing 50 ng mV^ kanamycln. Blue:white screening in the presence of Xgal [5- 
bromo-4-chloro-3-indoyl-p-D-galactosidel and IPTG fisopropylthio-p-D-galactoside] was 
used to detemiine cloning efficiency. If no DNA has been cloned into the Lambda vector, the 
p-galactosidase gene is expressed In the presence of the inducer IPTG, resulting in 
cleavage of the substrate analogue Xgal to produce a blue pigment in the colony. If however 
a fragment of the genomic DNA has been successfully cloned into the Lambda vector it 
disrupts the gene so that no enzyme is produced and the colony remains white. The ratio of 
blue to white colonies therefore can be used to calculate the percentage of clones containing 
an insert. For this library the blue:white screen gave a ratio of 7 blue to 286 white colonies, 
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Indicating that 97% of the clones contained an insert of the genomic DNA. Twenty four 
colonies were selected at random and plasmid DNA prepared using the Wlzard®Plus SV 
Miniprep DNA purification system (Promega UK, Southampton) Restriction analysis using 
Pst\ and H/ndlll which flank the BamH^ cloning site followed by agarose gel electrophoresis 
was used to determine insert sizes. One clone out of the 24 was found to have no detectable 
insert. The rest had inserts ranging from 1 .5 kb to 8.0 kb. 

Example 3 
Library screening for cellulases 

[145] DNA libraries in the pBK-CMV phagemid were screened for cellulase activity in a 
plate assay of the E. coli clones. To detect cellulase activity the genomic libraries were 
plated on LB agar containing kanamycin, 0.5% w/v carboxymethylcellulose (low viscosity 
sodium salt; Sigma. Poole. UK) and IPTG (15 ^1 of a 0.5 M solution spread on the surface of 
the agar in a 7 cm diameter Petri dish). Following overnight growth at 37»C. the colonies 
were overiayed with 3ml molten 0.7% w/v agarose dissolved in water which had been cooled 
to 50°C. After this had set. the plates were flooded with 0.1% w/v Congo Red solution for 30 
minutes followed by 2 washes with 1 M NaCI. Positive clones exhibiting extracellular 
cellulase activity were sun-ounded by a yellow halo against a red background (R. Teather 
and P.J. Wood. ApplieB & Environmental Microbiology, 43: 770-780, 1982). 
[1461 The screening of 1 1 0.000 E.coli pBK-CMV clones yielded 4 zones of clearing 
indicating potential cellullase-producing colonies. Three of these were successfully 
recovered as cellulase-producing clones after homogenising the agar plug removed from the 
cleared zone, streaking out for single colonies and confimiing the phenotype by the Congo 
Red test. 

Example 4 

Characterisation of a cellulase-posltive clone 
11471 Plasmid DNA was isolated from the three cellulase positive clones, and the size of 
the inserts determined by restricHon digestion as described above. All three had the same 
size (about 3.5kb) and the same size fragments after digestion as detennined by gel 
electrophoresis. This Indicated that all three isolates were identical, derived by amplification 
of a single clone. This was confirmed by the first round of sequencing of the plasmid DNA 
(using primer sites in the pBKCMV plasmid). This was carried out by the Protein and 
Nucleic Acid Chemistry Laboratory at Leicester University, using the Perkin Elmer 'BigDye' 
terminator chemistry and the model 377 ABI automated DNA sequencer. Complete 
coverage of the sequence was obtained by 'primer walking' from both the 5'and 3' ends of 
the insert. The sequence was edited using Applied Biosystems multisequence editor 
Seqed^*« version 1.0.3. Sequence was assembled with programmes In the GCG Wisconsin 
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Package, version 10.2-UNIX. available at the University of Leicester. This identified an 
insert of environmental DNA of 3410 nucleotide bases (Figure 1). 

Example 5 
Identification of the cellulase-gene 

5 [148] Possible Open Reading Frames (ORF) in the nucleotide sequence of the inserted 

environmental DNA of clone 029cel were identified using the ORF Find facility of the 

MapDraw program (DNASTAR, Brighton, MA, USA) or ORF Search from the Vector NT! 

Suite of programs (InforMax®, North Bethesda, MD, USA). 

[149] This identified an ORF composed of 1746 nucleotides corresponding to a protein of 
10 581 amino acids, starting at position 3004 of the insert sequence and ending at position 

1259. The sequence of this ORF was excised using EditSeq (DNASTAR) and examined by 
BLAST programs. 

[150] The nucleotide sequence of this ORF Is shown in Figure 2. 

[151] An examination of the nucleotide sequence using the BLASTn program, which 

15 compares a nucleotide query sequence against a non-redundant nucleotide sequence 
database, indicated no significant similarity to any known sequence. 
[152] An examination of the nucleotide sequence using the BLASTx program, which 
compares the six-frame conceptual translation products of a nucleotide query sequence 
(both strands) against a protein sequence database, revealed surprisingly, very low similarity 

20 (no more than 29%) to a number of bacterial endocellulases. The highest alignment score 
revealed 25% identity (1 32 amino acids) to a 527 amino acid region of CelJ, an enzyme 
comprising 1 601 amino acids, the largest catalytic component of the cellulosome of 
Clostridium thermocellum (protein id BAA1 207070.1 , accession D83704.1 ). It is very 
probable that an enzyme with homology this low would not have been detected using 

25 convention methods using DNA probes based on known cellulase gene sequences, 
especially given the very high diversity of cellulases already characterised. 
[153] The translated protein composed of 581 amino acids is shown in Figure 3. 

Example 6 
Enzyme Cliaracterization 

30 Influence of salt 

[154] Cells of E.coli pBK-CMV containing the 029cel gene are suspended in 5 ml buffer (20 

mM TRIS-HCI, pHS.O; 500 mM NaCI; 0.1 ml\^ EDTA; 0.1% Triton X-100) and disrupted by 

sonication on ice. The sonicated extracts are examined by agar diffusion assay on 

carboxymethylcellulose (CiViC) at different NaCI concentrations. Sonicated extracts (100 \iL) 

35 and 1 in 10 dilutions are placed in wells punched in CMC-agar plates containing varying 

amounts of NaCI. The plates are incubated at 37''C for 16 hours and the resulting clearing 
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zones indicating cellulose hydrolysis measured in millimetres. The cellulase 029cel is active 
over the range 0 - 25% w/v NaCI. although the activity at 25% w/v NaCI is lower than the 
activity at 0% NaCI. 

5 Influence ofoH ^ * • ♦ 

1155] The influence of pH on cellulase activity' is investigated using the pH-gradient plate 

method described by Grant & Tindall (Isolation of alkaliphilic bacteria. In: Microbial Growth 

and Survival in Extreme Environments, Academic Press, London, 1980, pp. 27-36). An agar 

medium containing CMC is poured to a depth of 1 cm in square Petri dishes and allowed to 

10 set. A unifomi trough 1 cm wide is cut from one edge of the plate and agar containing 20% 
w/v Na2CO3.10H20 and 0.2 M NaOH (prepared by mixing equal volumes of sterile 0.4 M 
NaOH/40% w/v NazCOa.lOHzO and 4% w/v agar at 60X) is poured into the trough. The 
plates are developed at ZTO overnight to allow a unifomi gradient from pH 12 to pH 7 to 
form. To test the pH tolerance of the 029cel cellulase a narrow trough is cut through the 

1 5 (agar) gradient at right angles to the original trough and filled with 1 ml of sonicated cell 
extract. The plates were allowed to develop overnight at 37^. The plates are treated with 
Congo Red for 30 minutes to visualize the zone of cellulose hydrolysis. The results indicate 
that ttie 029cel cellulase is active to about pH 1 1 .5. 

20 

[156] It is understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included within the spirit and purview of 
this application and scope of the appended claims. All publications, patents, and patent 
25 applications cited herein are hereby incorporated by reference in their entirety for all 
purposes. 
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What is claimed is: 

1, An isolated polynucleotide selected from the group consisting of: 

(a) a nucleic acid sequence having at least 85% sequence identity to presented as 
SEQ ID NO:1 , or the complement thereof; 

(b) a nucleic acid sequence which encodes or is complementary to a sequence 
which encodes an 029cel polypeptide having at least 85% sequence identity to 
the amino acid sequence presented in Figure 3 (SEQ ID NO:3); 

(c) a nucleic acid sequence which encodes or is complementary to a sequence 
which encodes an 029cel polypeptide having at least 90% sequence identity to 
the amino acid sequence presented In Figure 3 (SEQ ID NO:3); 

(d) a nucleic acid sequence which encodes or is complementary to a sequence 
which encodes an 029cel polypeptide having at least 95% sequence identity to 
the amino acid sequence presented in Figure 3 (SEQ ID NO:3); 

(e) a nucleic acid sequence which encodes or is complementary to a sequence 
which encodes an 029cel polypeptide having the amino acid sequence 
presented in Figure 3 (SEQ ID NO:3); 

wherein said isolated polynucleotide encodes a polypeptide having the biological activity of a 
cellulase and wherein the identity is detemiined by the CLUSTAL-W program in MacVector 
version 6.5. operated with default parameters, including an open gap penalty of 10.0. an 
extended gap penalty of 0.1, and a BLOSUM 30 similarity matrix. 

2. An isolated polynucleotide selected from the group consisting of. 

(a) a nucleic acid sequence presented as SEQ ID NO:1 , or the complement thereof; 

(b) a nucleic acid sequence that hybridizes, under high stringency conditions to the 
sequence presented as SEQ ID NO:1, or the complement or a fragment thereof, 

(c) a nucleic acid sequence presented as SEQ ID NO:2. or the complement thereof; 
and 

(d) a nucleic acid sequence that hybridizes, under high stringency conditions to the 
sequence presented as SEQ ID NO:2, or the complement or a fragment thereof, 

wherein said isolated polynucleotide encodes a polypeptide having the biological activity of a 
cellulase and wherein hybridization is conducted at 42°C in 50% fonnamide, 6X SSC, 5X 
Denhardt's solution, 0.5% SDS and 100 pg/ml denatured carrier DNA followed by washing 
two times in 2X SSPE and 0.5% SDS at room temperature and two additional times in 0.1 
SSPE and 0.5% SDS at 42''C. 
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3. The isolated nucleotide of claim 1 wherein the nucleotide is selected from the group 
mRNA, DNA, cDNA, genomic DNA. and an antisense analog thereof. 

4. The isolated polynucleotide of Claim 3, wherein said polynucleotide is an RNA molecule. 

5. The isolated polynucleotide of claim 1 encoding an enzyme having c-ellulase activity, 
wherein the enzyme is isolated from a Trichoderma source. 

6. The isolated polynucleotide of Claim 5, wherein the enzyme is isolated from Trichoderma 
reesei. 

7. An expression constmct comprising a polynucleotide sequence encoding an amino acid 
sequence having cellulase activity and (i) having at least 85% sequence identity to the amino 
acid sequence presented in SEQ ID NO:3. or (H) being capable of hybridizing to a probe 
designed to hybridize with the nucleotide sequence disclosed In Figure 2 under conditions of 
intermediate to high stringency, or (iii) being complementary to a nucleotide sequence 
having at least 85% sequence identity to a nucleotide sequence encoding the amino acid 
sequence presented in SEQ ID NO:3 wherein the identity is detemilned by the CLUSTAL-W 
program in MacVector version 6.5, operated with default parameters, including an open gap 
penalty of 10.0, an extended gap penalty of 0.1. and a BLOSUM 30 similarity matrix.. 

8. A expression vector comprising the polynucleotide of Claim 1 . 

9. A expression vector comprising an isolated polynucleotide of Claim 1, operably linked to 
control sequences recognized by a host cell transfomied with the vector. 

10. An expression vector according to Claim 9 comprising a regulatory polynucleotide 
sequence including a promoter sequence derived from a glucose isomerase gene of 
Actmoplanes. a signal sequence derived from a Streptomyces cellulase gene, and a 
polynucleotide sequence encoding a 029cel cellulase. 

1 1 . A vector comprising the expression construct of Claim 8. 

12. A host cell transfonmed with the vector of Claim 8. 

13. The host cell of Claim 12, which is a prokaryotic cell. 

14. The host cell of Claim 12, which is a eukaryotic cell. 

15. A substantially purified 029cel polypeptide with the biological activity of a cellulase, 
comprising a sequence selected from the group consisting of: 

(a) an amino acid sequence having at least 85% sequence identity to the amino 
acid sequence presented in Figure 3 (SEQ ID NO:3); 
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(b) an amino acid sequence having at least 90% sequence identity to the amino 
acid sequence presented in Figure 3 (SEQ ID NO:3); 

(c) an amino acid sequence having at least 95% sequence identity to the amino 
acid sequence presented in Figure 3 (SEQ ID NO:3); 

(d) an amino acid sequence presented in Figure 3 (SEQ ID NO:3); 

(e) a substantially purified biologically active fragment of the amino acid sequence 
presented as SEQ ID NO:3 

wherein the identity is detemiined by the CLUSTAL-W program in MacVector version 6.5, 
operated with default parameters, including an open gap penalty of 10.0. an extended gap 
penalty of 0.1 , and a BLOSUI\/i 30 similarity matrix. 

16. The substantially purified 029cel cellulase polypeptide or a derivative is provided which 
is obtainable from a Bacillus. 

17. A method of producing a cellulase comprising the steps of: 

(a) culturing the host cell according to claim 12 in a suitable culture medium under 
suitable conditions to produce the cellulase; 

(b) obtaining said produced cellulase. 

18. The method of Claim 17 wherein the host cell Is a filamentous fungi or yeast cell. 

19. The method of Claim 17 wherein the host cell Is a bacterium. 

20. The method of Claim 19 wherein the bacterium is a Streptomyces. 

21 . A purified enzyme having cellulase activity prepared by the method of Claim 17. 

22. A recombinant host cell comprising a deletion or insertion or other alteration in the 
029cel gene which inactivates the gene and prevents 029cel polypeptide production. 

23. An antlsense oligonucleotide complementary to a messenger RNA that encodes an 
029cel polypeptide having the sequence presented as SEQ ID NO:3, wherein upon 
exposure to a cellulase-producing host cell, said oligonucleotide decreases or inhibits the 
production of cellulase by said host cell. 

24. The antlsense oligonucleotide of Claim 23, wherein the host cell is a filamentous fungi. 

25. A detergent composition, said composition comprising a polypeptide selected from the 
group consisting of: 
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(a) an amino acid sequence iiaving at least 85% sequence identity to the amino 
acid sequence presented in Figure 3 (SEQ ID NO:3); 

(b) an amino acid sequence having at least 90% sequence identity to the amino 
acid sequence presented in Figure 3 (SEQ ID N0:3); 

(c) an amino acid sequence having at least 95% sequence identity to the amino 
acid sequence presented in Figure 3 (SEQ ID NO:3); 

(d) an amino acid sequence presented in Figure 3 (SEQ ID NO:3); 

(e) a substantially purified biologically active fragment of the amino acid sequence 
presented as SEQ ID NO:3 

wherein the Identity is determined by the CLUSTAL-W program in MacVector version 6.5, 
operated with default parameters, including an open gap penalty of 10.0, an extended gap 
penalty of 0.1, and a BLOSUM 30 similarity matrix. 

26. A detergent composition comprising a surfactant and a cellulase according to Claim 15. 

27. The detergent according to claim 25. wherein said detergent Is a laundry detergent. 

28. The detergent according to claim 25. wherein said detergent is a dish detergent 

29. A feed additive comprising a cellulase according to claim 15. 

30. A method of treating wood pulp comprising contacting said wood pulp with a cellulase 
according to claim 15. 

31. A method of converting blomass to sugars comprising contacting said biomass with a 
cellulase according to claim 15. 

32. The method of Claim 31 further comprising the generation of high fructose com-syrup 

33. A method of producing ethanol, said method comprising the steps of: 

(a) contacting a biomass composition with an enzymatic composition comprising 
029cel to yield a sugar solution; 

(b) adding to the sugar solution a fenmentative microorganism: and 

(c) culturing the fermentative microorganism under conditions sufficient to produce 
ethanol, 

33. A method of Identifying novel enzymes comprising: 

(a) isolating total microbial community DNA from an environment; 

(b) constructing a genomic DNA library in E.coir, 

(c) screening the library for expression of cellulase activity; 

35 



wo 2(MM/097001 PCT/US2004/013258 



identifying the cellulase gene in the cellulase-posltive clone; and 
characterising the novel cellulase enzyme. 
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Figure 1 A Nucleotide sequence of Inserted environmental DNA (029cel). 

SEQIDNOrl 



ATCAACACGC TGGAAAGTAA TTTCAAGGGT 
GGGTAGAAAT GTGCGGTTGG ATTTCGTTGA 
CCGAGGGCAT AGCGCAGCAG GTTGGCGATG 
GCCGCCTACG ATGTTGTGCT CAGCCGCCCA 
CGGGTTCGCT CGCGGGGGTG AAGAAGACAA 
CCGCTTCCGC TCTCAACGCC GCCGAGGTTG 
CGTTAGGTCC AGCACGGAAT CGCCGACGAG 
TGCGCCCATA GGGTTGGGTA CGCGGAAGGG 
TCATTGGGCG AGACGAGCCG GACAAAGATT 
GCCTTGGAGG GCGAGAGAAA GGTACGTGAG 
. GACCGTCTCC CCAGTTGTCG AGATTGAGCC 
GCGATAGTGT AGCTCCAATG GTAGTGACGC 
GGAGAGTTCC TGCAAGCCGT CGCCCCAAAT 
CGTCACGGTA GATCACAAGT TCGGCGGCGG 
TGAGTGATCA CGAGAGTGGC GGTGGCGCTG 
AATGGTGGCG ACGACCGTGT AGCTACCGGG 
AACCGTTGTA GGTAAAGGAG ACGTCAAGCC 
AGAGCGGCCT TGGGGGTGCC GTCGAAAACG 
GATGGTGGCG GGTGCCTTGA GCACAGTCAC 
GATCGTGCGC TGCCGTGTCT GCAGGTGTGA 
GTTCCGGCGG ACGGTGCAAG GCCGGACAGG 
GACGGCGGCT ACTCCGCCGC TCAGGCCGGC 
AGGTGATGGG TGCGGCTGTG GGCCACATCT 
TCGTCACGCA CCGGCATGAG GGCGGAGAGG 
GTAATTGATG TCGGGCTCGG TGATTTCCCA 
CATTCCAATC AAGGTAGGCT TTTTGCACGG 
ATGCTTCCGC TGTATTTGGG CATTGGGACC 
GCGGGACCGT AGAGTGAAGT GAGGGCATTG 
AACCAATGGT GGTAGATTTC ATTGGCTGCA 
CATGTTGCTA AGATAGACCA TGCCCATTGG 
GCAGGTAGCC CATCGCGGCA TCGCGATGCG 
AGCCCAAGCC TCCGTACCCC CTCGAAGAAA 
TTTGTTCGAG CCCCACGTGT AATCCTGATC 
CGTCGGTCTG GTTATTCCAT GCACCGAGAA 
GCCGCCATCC GGTTGCGGAT GTCGGCAGAG 
GAGGGTCGTG TAGTGGGCGA GAGCTTTTTG 
AGAAATACCA CCACTGCACG GGCTCCATAT 
AAATCGCGAT AGACCGCACC GCCCGTGCGC 
CATCACACGG TTGGCTAGCG TATCGTGGGC 
AAGCAAATCC GGTGTTGTCG AAAGGCACAT 
TTCCATGCGG CGATGGCAGC GGATTCGAGG 
GCCTACGCTC TCAAAGACAG TCGCCCCGAG 
TGGCAGTGGC CTCGGTCGAG ACGGGGCCGT 
GTGCTCGGCG GGCTGGCGCT CTGGTGCCCC 
AATAGCCCCG CTCGGCTCCT GCATGCGTAA 
TGACTTCGTC AAGCAGGTCG GGGACACCGT 
AAATCATCGG TAAAGACGTC AGGCCGCCCT 
CAGGATGACG CGCCCCGTCC ACTCGCTGTA 
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Figure 1B 

CGAACCAACC GCCGCTGAGA TCGCGCTCCA 
CAGATGGGGC GGCTGGCGAC GTCCTGCGGG 
GTTCGCGTGG GCGTAGGGCA CCTCCTTGGC 
AGAACATGCG CACGGCCTCG CGCAGGACAA 
ATGGCGAAAC TATCGGAATG AGTGTTGTTG 
GTGGCCGGGC TCGGCAACTA CCGTAAAATC 
CCGATTGAAT ATGGATGGCG CCGCCGTTCC 
AAAACCACGA CGCCATCGTT CACGCGACGG 
GCTGTAGCTC TCGGCGCTGT TCCAGCCAAT 
CCACCTTGGT GGCATCGGCG GGGTAACCGA 
TCGGTGTGGG TGGAGGCGAC GAGGGCGGAG 
AAAGCCCGCT GTCGGCCCGA TACCAAAAAA 
TCATAGCAGG ATGTGGATAC GGAAAGGGGG 
GCCCAACGCT TGGCGAAAAC TGGATGGTTG 
TTGAGCCAAA AGCTGCGGGC AATCCTTATT 
ATCGTCGGCG GCACGACTTT TCGATGGGCG 
AGGCGCGAGG CTGCAAACCT TATGAAAAAA 
CCCGGTCAAA ATCCAGTCAA GGTTTGTTCA 
GGCACAGTCG AGCCATCAGC AGTCGCATTG 
TGTGCAAATG ACCGCTGCCG AAGGAACTGT 
CCTCGCCAAG 
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Figure 2 

The nucleotide sequence of 029cei ORF 



ATGAATTTTT CTCTCCCTAT TCGTTTTTTT 
CTTTTTCTTG CTGCTCATGG GCAGCTCCGC 
CCGATAAAAT CCGCATCGAC CAATTCGGTT 
GTGGCGGTGA TCGCCGACCC GCAGATTGGC 
CAGCCCCGGC GCAACGCTGG AGGTCCGTCG 
TTTTCTCCGG CTCACCGGTC CCGTGGAACG 
TCGGGAGACC GCGTGTGGTG GTTTGATTTT 
CCACTACCGC ATCCACGATC CTGCCAACAA 
CCATTGGCGC GGATGTTTAC GATGTTGTCC 
TTCTTCTATC AGCGCTCCGG GTTTGCCAAG 
GAACTGGGCC GATGCCGCTT CTCACCCGCA 
TCTGGGATAT GGGGAATGCC TCCTTGGAGC 
TTCGATGCGG GCGATTTCAA CAAGTACAGC 
CCTGGAGCTG CTCCTTGCCT ATCAAGGGCG 
ATTTTGGCAT CCCGGAATCC GGCAACGGTG 
GTCAAATGGG GAATGGACTG GCTCTTACGC 
TATTCTCGGG AAAGTTTCCG TGACGGGGCA 
GCACCGACAC CCATCCGCGT TACTACGGCC 
GCCATGGCTG CCGCCGCTTT CGCCCTCGGG 
AGGCATGAGC GATTATGCCG TCACCCTCGA 
GGAATTGGAC CATGGTCCAT CCTCATGTGC 
GCTTCCGTGA GCCCCTCGCG CAATGCCCAC 
GATGGCCGCC GCCATGCTCT TCGAGCGCAC 
ATTTCTTCGA TGTGCGCTAT CTCGATATGG 
TTCTTCCCCT TTCAAGGTGA GCTACAAAAA 
CCTCCCGGGA GCGACGCCTA GCGTCTCTGC 
CGGCTTCTAT AAACGGTGGG GAGTTTCTCG 
GACGCCTACC GCGCCTACCT GAAGGATCAG 
CAAAACAAAG TCTCAGGCTG GCTTTTTCTT 
GGCTCAACCC CGCCGACGCG GCCGCGCATC 
CTGCACTATC TCCACGGAGT GAACCCAATG 
CATGTATGCC AGCGGCGCTG ACCGTGCAGC 
GGTTCCGCGA TGGCCGGACT GGGACAATGC 
CCGCTCCTGG TTTTCTTTCG GGCGGGTCCC 
GCATTCAGGC GATCCGAGAC CAACCCGTGC 
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Figure 3 

The translated 029cel protein composed 



MNFSLPIRFF GIGPTAGFFL LLMGSSALVA 
VAVIADPQIG WNSAESYSPG ATLEVRRVND 
SGDRVWWFDF TWAEPGHYR IHDPANNTHS 
FFYQRSGFAK EVPYAHANWA DAASHPQDVA 
FDAGDFNKYS EWTGRVILEL LLAYQGRPDV 
VKWGMDWLLR MQEPSGAILG KVSVTGHQSA 
AMAAAAFALG ATVFESVGMS DYAVTLESAA 
ASVSFSRNAH DTLANRVMAA AMLFERTGGA 
FFPFQGELQK ALAHYTTLPG ATPSVSADIR 
DAYRAYLKDQ DYTWGSNKTK SQAGFFFEGV 
LHYLHGVNPM GMVYLSNMYA SGADRAANEI 
PLLVFFRAGP NAQIQRKHSG DPRPTRAKSL 



f 581 amino acids 
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